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This paper proposes and illustrates a general framework to integrate the areas of vision research 
and complex networks. Each image pixel is associated to a network node and the Euclidean distance 
between the visual properties (e.g. gray-level intensity, color or texture) at each possible pair of 
pixels is assigned as the respective edge weight. In addition to investigating the therefore obtained 
weight and adjacency matrices in terms of node degree densities, it is shown that the combination 
of the concepts of network hub and ^-expansion of the adjacency matrix provides an effective means 
to separate the image elements, a challenging task in computer vision known as segmentation. 

PACS numbers: 89.75.Fb, 89.75. He, 85.57.-s, 85.58.Mj 



I. INTRODUCTION 

While effortless to humans and animals, the visual in- 
terpretation of scenes - i.e. the process of vision pj— rep- 
resents one of the greatest challenges to science. Indeed, 
no automated vision system [2j has so far been obtained 
whose performance can be compared even to those of 
the simplest animals. At the same time, relatively lit- 
tle is known about how biological vision operates. Great 
part of the difficulty in understanding and implement- 
ing vision derives from the lack of general concepts and 
approaches to represent, characterize and model the in- 
tricate data and processes underlying visual processing. 
One particularly relevant aspect is the fact that 2D im- 
ages are normally obtained by projecting 3D scenes onto 
some planar surface, implying substantial loss of informa- 
tion and degeneration. One immediate effect of such pro- 
jections is the fact that the spatial adjacency relations are 
abruptly changed, in the sense that two spatially sepa- 
rated objects in the 3D scene may become adjacent in the 
2D projection. At the same time, the superposition of ob- 
jects (i.e. occlusion) may break the image background (or 
another object) into two or more disconnected portions. 
For instance, the pen in Figure |3J a) separates the white 
background into two parts. Another interesting aspect 
is the fact that, because of image resolution constraints 
implied by the size of the photoreceptors, stereo vision 
becomes useless for objects further away than about 100 
meters. Even so, the fact that humans are still fully able 
to interpret such scenes and print pictures clearly indi- 
cates that there are effective solutions to the problem 
of understanding 2D images. One of the hardest prob- 
lems in vision is the process of partitioning the image 
into meaningful connected components, which is known 
as segmentation 013 Despite continuing attention from 
the scientific community, the problem of finding a seg- 
mentation approach capable of producing suitable results 
for general images remains a considerable challenge, as a 
consequence of the degenerations implied by 2D imag- 
ing allied to other problems such as reflexes and shades. 
While fully general and automated image segmentation 



will ultimately depend on the incorporation of high-level 
information about image formation and the objects to be 
expected in image (i.e. good models of the visual world), 
it is argued here that reasonable segmentations can be 
obtained by considering the full range of spatial interac- 
tions between the image elements. 

The interesting problems implied by vision have mo- 
tivated approaches based on areas ranging from partial 
differential equations (e.g. 0) to wavelets (e.g. 
Several physics-based approaches to vision [(| have been 
reported in the literature, including the use of the Potts 
model for image segmentation (e.g. 13), deformable con- 
tours (e.g. Q, random fields (e.g. jj) and fractals (e.g. 
[Io|). While the projection of 2D scenes can be modeled 
as random scalar fields, digital images are normally rep- 
resented in terms of matrices, where each element cor- 
responds to a pixel, which provides a poor representa- 
tion of the original visual information in the sense that 
it does not incorporate any information about spatial in- 
teractions and correlations along several spatial ranges 
between the pixels. Several graph-theoretical approaches 
to image analysis and computer vision, interesting graph- 
based approach to visual representation and analysis \V\\ 
involve the use of graphs where each pixel is associated 
to a node and the difference between the visual proper- 
ties of adjacent pixels are used to define the respective 
edge weights. One particularly interesting graph-based 
method called image foresting transform IFT has 
been described (e.g. [l2T |) where minimal paths are iden- 
tified between pixels considering some spatial adjacency, 
yielding a forest |22l | of minimum paths connecting the 
image pixels. Several useful image properties can be de- 
rived from such graph representations, including the seg- 
mented image regions |l2l | and multiscale skeletons |13|. 

The current work proposes a complex network [Til 
Il5l llq] approach to vision research where each pixel is 
represented as a node and the distance between the vi- 
sual properties (e.g. gray-level, color or texture) of ev- 
ery pair of pixels in the image is adopted as the respec- 
tive edge weight. The distance between the pixels can 
also be considered into the edge weight. The resulting 
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fully-connected graph is subsequently thresholded at a 
specific value T in order to obtain the respective adja- 
cency matrix A It is argued here that, by incor- 
porating all ranges of spatial interactions between the 
image elements, such representations present good po- 
tential for integrating the relevant visual properties from 
the lowest to the highest levels of abstraction, leading to 
more effective visual processing. Moreover, it is also sug- 
gested that the use of the concepts and tools underlying 
complex network research (e.g. node degree, clustering 
coefficient, shortest path, L-expansions and critical phe- 
nomena) can provide relevant information to be used for 
image and object characterization. Such a potential is 
illustrated in this paper with respect to image character- 
ization in terms of node degree densities and the use of 
L-expansions of the graph in order to obtain enhanced 
image segmentation. 



II. IMAGE REPRESENTATION AS A 
COMPLEX NETWORK 

The original gray-level image is assumed to have M x 
M pixels, each of which can take gray- level values ranging 
from to G, which is proportionally normalized into the 
interval [0,1]. Therefore, the network representing the 
image contains N = M 2 pixels and n = N(N — l)/2 
weighted edges, which are represented by the N x N 
weight matrix W. Equations define a possible map- 
ping between each node i of the network and the im- 
age pixels (x,y), where 1 < x,y < M and the function 
mod(a,b) stands for the remainder of a divided by b. 

i = y+(x-l)M (1) 
x = [(k - 1)/M\ + 1 (2) 
y = mod((k-l),M) + l (3) 

Several types of interactions between two pixels can 
be defined respectively to each possible visual attribute, 
such as light intensity, color components, local shape, 
texture, motion, and disparity (stereo), as well as the 
pixels position and spatial adjacency |l8| between pairs 
of pixels. The scalar values derived from such proper- 
ties can be organized into a feature vector / in R N , 
in such a way that each network node i becomes as- 
sociated to a feature vector fi describing the local vi- 
sual properties around the respective image pixel. The 
weight of the edge connecting two nodes i and j is there- 
fore defined as w(j,i) — \\fi — fj\\, where ||a|| denotes 
the Euclidean norm. As such, the edge weight reflects 
the visual dissimilarity between the respective pair of 
pixels. In case the visual features take into account a 
small neighborhood around each pixel, such as is the 
case with texture or motion, the edge weights will quan- 
tify the dissimilarity between local properties around the 
pairs of pixels. Let 5t{cl) be the operator acting over 



the matrix a in such a way that the value one is as- 
signed whenever the respective element of a has absolute 
value larger or equal than the specified threshold T; e.g., 
S 2 (x = (4,-2,0,-3,1)) = (1,1,0,1,0). Thus, the ad- 
jacency matrix A underlying the weighted complex net- 
work r for a specific threshold value T can be obtained as 
A = St(W). The average node degree of T is henceforth 
represented as z. 

The concept of L-expansion of a network, introduced 
in |19|| provides an interesting possibility for identifying 
L-paths along the represented network. The L-expansion 
of a network V consists of the network containing the 
same number of nodes as T where an edge is incorporated 
between nodes i and j whenever a path of length L is 
found to exist between those nodes in the original net- 
work r. Because of the high computational cost implied 
for the calculation of such expansions for large values of 
L, the current work is limited to L — 2, in which case the 
adjacency matrix A2 of the ^-expanded network can be 
immediately obtained from the squared adjacency matrix 
B — A 2 while ignoring the elements in the resulting main 
diagonal, i.e. 

A2(i j) = { B&fi if % ^ ] (4) 
y ,J ' (0 otherwise v ; 



III. IMAGE CHARACTERIZATION IN TERMS 
OF TOPOLOGICAL MEASUREMENTS 

The possibility of using complex network concepts for 
image characterization is illustrated in the following with 
respect to the images in Figures OJa) and IHa). Fig" 
ures^Ia) and (b) presents the dilogarithm representation 
of the node degrees, considering the respective weights, 
for the network obtained for those two images consider- 
ing T = 0.1. The degree of a node i was obtained by 
adding the weights of all edges connected to that node. 
Figures ^c) and (d) show the node degree densities ob- 
tained for the respective adjacency matrices, while the 
node degrees considering ^-expansions of the original net- 
works are shown in Figures ^e) and (f). Interestingly, 
the density obtained for the image in Figure^a) presents 
two peaks, suggesting the existence of two types of inter- 
actions (weak and strong) between the image elements 
in the respective image. The densities obtained for the 
adjacency matrices A indicate a predominance of higher 
node degrees, indicating that the image elements tend 
to be connected by many edges. The effect of the 2- 
expansions in enhancing the connections between pixels 
can be readily observed by comparing Figures fTFeWD 
to (c)-(d). It is clear from this example that rather dif- 
ferent node degree signatures are obtained for the two 
considered images, suggesting that such features can be 
effective for image characterization and classification. 
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FIG. 1: Dilog representation of the density of node degrees, 
considering the edge weights, for the networks derived from 
the images in Figures 01 a) and Figures Ufa) . The densities 
obtained from the respective adjacency matrix A, (c) and 
(d), and A2, (e) and (f). The adopted threshold value was 
T = 0.1. 



IV. HUB-BASED IMAGE SEGMENTATION 

Now we turn our attention to the important and chal- 
lenging problem of image segmentation. One of the pos- 
sible manners to approach such an issue is by consid- 
ering that each segmented region can be understood as 
a connected component of the respective network whose 
nodes are characterized by similar visual properties, such 
as the gray-level relative uniformity inside the objects 
in Figure Ofa). Considering that the network hubs are 
associated to a large number of other nodes, it would 
be reasonable to expect that those nodes share similar 
properties between themselves and also with the respec- 
tive hub. This fact suggests the following algorithm for 
partitioning the network into connected components: 

Obtain the weight matrix W from the image; 
Threshold W so as to obtain the adjacency matrix A; 
For k = 1 to K do 

Identify the node h with the highest node degree 

in A (the hub); 
Join all nodes connected to h, and also this node, 

into a detected cluster C; 
Remove from the adjacency matrix all edges 
between the nodes in C; 

The number of repetitions K corresponds to the num- 
ber of expected clusters. This stop condition can be mod- 
ified in order to consider the number of nodes obtained 
in each cluster, terminating the algorithm when all pix- 
els are labeled or when the remaining number of unla- 
beled pixels is smaller than a threshold value. While the 
weighted edges in the networks representing images pro- 
vide an indication about the dissimilarity degree between 
pairs of pixels, they have been experimentally verified to 
lead to unsuitable clusters. This is illustrated in Fig- 
ure Etb)-(c). The cluster obtained for the first identified 
hub, marked by the cross in Figure |2Ib), was unsuitable 
in the sense that it missed part of the background (the 
black region at the bottom right corner of the image). 
The cluster obtained for the second dominant hub, shown 
in Figure CJc) also missed part of the pen (the diagonal 
black stripe) . This tendency to produce incomplete clus- 
ters is related to the fact that there are several pixels 
indirectly connected to the respective hub which, though 
sharing visual properties, are not included into the clus- 
ter. This problem can be addressed by considering the 
^-expansions of the network, which will connect all pix- 
els united to the hub by a 2-path. At the same time as 
the above problem is reduced, pixels directly connected 
to the hub which are not also connected through at leas 
one D-path are ignored by the expansion. This enhanced 
complex network based segmentation algorithm therefore 
is given as follows 

Obtain the weight matrix W from the image; 
Threshold W so as to obtain the adjacency matrix A; 
Obtain the adjacency matrix A2 for the ^-expanded 
network 



For k = 1 to K do 

Identify the node h with the highest node degree 

in A2 (the hub); 
Join all nodes connected to h, and also this node, 

into a detected cluster C; 
Remove from the adjacency matrix all edges 

between the nodes in C; 

Figure |3 illustrates the potential of the enhanced seg- 
mentation algorithm with respect to the initial image (a), 
which corresponds to a coarse version of the image in 
Figure Eta), which is considered here for the sake of il- 
lustration. The first three identified hubs (crossed pixel) 
and respective clusters (light gray-level values) are shown 
in (d)-(f). The clusters obtained for the two first hubs, 
shown in Figures|2f d) and (e) resulted considerably supe- 
rior to those obtained by using the traditional edges (i.e. 
the clusters in Figures H£b) and (c)). Interestingly, the 
first identified hub, which correspond to the image back- 
ground, implied in a respective segmented region (light 
gray-level) merging the two portions of the background 
which were previously separated. Further examples of 
the potential of the proposed segmentation algorithm are 
given in Figures [31(b) and0| 

In similar fashion to the characterization of whole im- 
ages in terms of complex network measurements pro- 
posed in Section ITTT1 it is possible to apply such an ap- 
proach to characterize the regions produced by segmen- 
tation algorithms such as that described above. This can 
be immediately accomplished from adjacency matrices 
defined for each segmented region. Figure [5] illustrates 
the node degree densities (after ^-expansion) obtained for 
the segmented regions (light gray-level) in Figures Hfd) 
and (e). It is clear from this example that rather distinct 
connecting patterns can be obtained for two different re- 
gions of the same image. Such region-oriented topological 
measurements obtained for the representing network pro- 
vide valuable subsidies for object recognition and texture 
characterization. 




FIG. 2: A coarse gray-level image (a) and the first two hubs, 
marked by crosses in (b)-(c), and respective clusters (light 
gray-level) obtained by considering traditional edges. The 
three identified hubs (d)-(f) and respective clusters obtained 
from the hyperedges defined by the ^-expansions. 



V. CONCLUDING REMARKS 

This paper has illustrated how digital images can be 
effectively represented, characterized and analyzed in 
terms of graphs taking into account several ranges of 
spatial interactions. The potential of such a frame- 
work has been illustrated for image characterization in 
terms of node degree densities and for image segmenta- 
tion by using a combination of the concepts of hub and 
^-expansions of a network. Several future developments 
have been motivated by such results. First, it would be 
interesting to investigate how complex network measure- 
ments such as the node degree densities, clustering coef- 
ficient and minimal length can be used for the character- 
ization of broad classes of images (e.g. landscale scenes, 
textures, medical images, etc.). Another promising pos- 
sibility is to use higher L-expansions in order to further 





FIG. 3: Image containing a coin, a pen and part of a floppy 
disk (a) and respective segmentation (b) obtained for T = 
0.05. 



5 




FIG. 4: Lenna's image (a) and respective segmentation (b) 
obtained for T = 0.05. 
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enhance the proposed segmentation approach. At the 
same time, the concept of hub-oriented clustering con- 
sidered in this work can be immediately extended for 
general pattern recognition, allowing the identification 
of prototypes (i.e. the respective hubs) of each identi- 
fied class. Along similar lines, it would be interesting 
to investigate how alternative partitioning schemes such 
as that reported in |20j behave for image segmentation, 
with prospects for integrating approaches. Other further 
developments include the consideration of the segmented 
regions as spatial networks and the topographical 
integration of networks obtained for distinct image prop- 
erties [2]]]. Finally, it would be particularly interesting 
to check how the topological connections of the networks 
representing digital images change with the progressive 
increase of the threshold T, with the possible occurrence 
of critical phenomena (e.g. percolation). 
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FIG. 5: The node degree densities obtained from the adja- 
cency matrix (after ^-expansion) describing the pixel connec- 
tions inside the first (a) and second (b) obtained segmented 
regions considering the image in Figure |3fa). 
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